Multi-label sampling based on local label imbalance

نویسندگان

چکیده

• The local imbalance is more crucial than the global one in multi-label data. based measure assesses hardness of MLSOL and MLUL tackle class issue via imbalance. Suitable application situations our two methods are identified, respectively. Class an inherent characteristic data that hinders most learning methods. One efficient flexible strategy to deal with this problem employ sampling techniques before training a model. Although existing approaches alleviate datasets, it actually level within neighbourhood minority examples plays key role performance degradation. To address issue, we propose novel assess label as well approaches, namely Multi-Label Synthetic Oversampling on Local (MLSOL) Undersampling (MLUL). By considering all informative labels, creates diverse better labeled synthetic instances for difficult examples, while eliminates harmful their region. Experimental results 13 datasets demonstrate effectiveness proposed variety evaluation metrics, particularly case ensemble classifiers trained repeated samples original

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Towards Label Imbalance in Multi-label Classification with Many Labels

In multi-label classification, an instance may be associated with a set of labels simultaneously. Recently, the research on multi-label classification has largely shifted its focus to the other end of the spectrum where the number of labels is assumed to be extremely large. The existing works focus on how to design scalable algorithms that offer fast training procedures and have a small memory ...

متن کامل

Multi-label LeGo - Enhancing Multi-label Classifiers with Local Patterns

The straightforward approach to multi-label classification is based on decomposition, which essentially treats all labels independently and ignores interactions between labels. We propose to enhance multilabel classifiers with features constructed from local patterns representing explicitly such interdependencies. An Exceptional Model Mining instance is employed to find local patterns represent...

متن کامل

Towards Class-Imbalance Aware Multi-Label Learning

In multi-label learning, each object is represented by a single instance while associated with a set of class labels. Due to the huge (exponential) number of possible label sets for prediction, existing approaches mainly focus on how to exploit label correlations to facilitate the learning process. Nevertheless, an intrinsic characteristic of learning from multi-label data, i.e. the widely-exis...

متن کامل

Multi-Label Learning with Global and Local Label Correlation

It is well-known that exploiting label correlations is important to multi-label learning. Existing approaches either assume that the label correlations are global and shared by all instances; or that the label correlations are local and shared only by a data subset. In fact, in the real-world applications, both cases may occur that some label correlations are globally applicable and some are sh...

متن کامل

Declaratively Capturing Local Label Correlations with Multi-Label Trees

The goal of multi-label classification is to predict multiple labels per data point simultaneously. Real-world applications tend to have high-dimensional label spaces, employing hundreds or even thousands of labels. While these labels could be predicted separately, by capturing label correlation we might achieve better predictive performance. In contrast with previous attempts in the literature...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Pattern Recognition

سال: 2022

ISSN: ['1873-5142', '0031-3203']

DOI: https://doi.org/10.1016/j.patcog.2021.108294